PLINK: a tool set for whole-genome association and population-based linkage analyses.
نویسندگان
چکیده
Whole-genome association studies (WGAS) bring new computational, as well as analytic, challenges to researchers. Many existing genetic-analysis tools are not designed to handle such large data sets in a convenient manner and do not necessarily exploit the new opportunities that whole-genome data bring. To address these issues, we developed PLINK, an open-source C/C++ WGAS tool set. With PLINK, large data sets comprising hundreds of thousands of markers genotyped for thousands of individuals can be rapidly manipulated and analyzed in their entirety. As well as providing tools to make the basic analytic steps computationally efficient, PLINK also supports some novel approaches to whole-genome data that take advantage of whole-genome coverage. We introduce PLINK and describe the five main domains of function: data management, summary statistics, population stratification, association analysis, and identity-by-descent estimation. In particular, we focus on the estimation and use of identity-by-state and identity-by-descent information in the context of population-based whole-genome studies. This information can be used to detect and correct for population stratification and to identify extended chromosomal segments that are shared identical by descent between very distantly related individuals. Analysis of the patterns of segmental sharing has the potential to map disease loci that contain multiple rare variants in a population-based linkage analysis.
منابع مشابه
The Pattern of Linkage Disequilibrium in Livestock Genome
Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...
متن کاملSecond-generation PLINK: rising to the challenge of larger and richer datasets
BACKGROUND PLINK 1 is a widely used open-source C/C++ toolset for genome-wide association studies (GWAS) and research in population genetics. However, the steady accumulation of data from imputation and whole-genome sequencing studies has exposed a strong need for faster and scalable implementations of key functions, such as logistic regression, linkage disequilibrium estimation, and genomic di...
متن کاملIdentification of genomic loci controlling phenologic and morphologic traits in barley (Hordeum vulgare L.) genotypes using association analysis
Association mapping is a technique with high resolution for QTL mapping based on linkage disequilibrium and has shown more promising for describing genetically complex traits. In addition, it is a powerful tool for describing complex agronomic traits and identifying alleles that can contribute to enhance the desired traits. In this study, whole genome association mapping was used in a set of 14...
متن کاملAuthor's response to reviews Title: Non-Replication Study of a Genome-Wide Association Study for Hypertension and Blood Pressure in African Americans Authors:
Major compulsory revisions requested by reviewer: 1. The authors are attempting to replicate a genetic association study. However, the association analysis presented was not a genetic analysis but an epidemiologic analysis based on comparing genotype frequencies and ANOVA between the genotypes. A genetic association analysis (i.e. one that considers genetic models additive, recessive, dominant)...
متن کاملLAMPLINK: detection of statistically significant SNP combinations from GWAS data
One of the major issues in genome-wide association studies is to solve the missing heritability problem. While considering epistatic interactions among multiple SNPs may contribute to solving this problem, existing software cannot detect statistically significant high-order interactions. We propose software named LAMPLINK, which employs a cutting-edge method to enumerate statistically significa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- American journal of human genetics
دوره 81 3 شماره
صفحات -
تاریخ انتشار 2007